Overview

Dataset Statistics

Number of Variables 17
Number of Rows 11162
Missing Cells 25
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 7.8 MB
Average Row Size in Memory 736.5 B
Variable Types
  • Numerical: 6
  • Categorical: 11

Dataset Insights

duration is skewed Skewed
campaign is skewed Skewed
pdays is skewed Skewed
previous is skewed Skewed
balance has a high cardinality: 3802 distinct values High Cardinality
month has constant length 3 Constant Length
pdays has 8324 (74.57%) negatives Negatives
previous has 8324 (74.57%) zeros Zeros

Variables


age

numerical

Approximate Distinct Count 76
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.4 KB
Mean 41.2319
Minimum 18
Maximum 95
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.8627)

Quantile Statistics

Minimum 18
5-th Percentile 26
Q1 32
Median 39
Q3 49
95-th Percentile 61
Maximum 95
Range 77
IQR 17

Descriptive Statistics

Mean 41.2319
Standard Deviation 11.9134
Variance 141.9284
Sum 460231
Skewness 0.8627
Kurtosis 0.6207
Coefficient of Variation 0.2889
  • age is not normally distributed (p-value 0.0023021823525419777)
  • age has 171 outliers

job

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 810.4 KB

Length

Mean 9.3491
Standard Deviation 1.8538
Median 10
Minimum 6
Maximum 13

Sample

1st row admin.
2nd row admin.
3rd row technician
4th row services
5th row admin.

Letter

Count 100672
Lowercase Letter 100672
Space Separator 0
Uppercase Letter 0
Dash Punctuation 2349
Decimal Number 0

marital

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 782.7 KB
  • The largest value (married) is over 1.81 times larger than the second largest value (single)

Length

Mean 6.8007
Standard Deviation 0.6256
Median 7
Minimum 6
Maximum 8

Sample

1st row married
2nd row married
3rd row married
4th row married
5th row married

Letter

Count 75909
Lowercase Letter 75909
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (married, single) take over 50.0%
  • The largest value (married) is over 1.81 times larger than the second largest value (single)

education

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 799.1 KB

Length

Mean 8.3117
Standard Deviation 0.7566
Median 8
Minimum 7
Maximum 9

Sample

1st row secondary
2nd row secondary
3rd row secondary
4th row secondary
5th row tertiary

Letter

Count 92775
Lowercase Letter 92775
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (secondary, tertiary) take over 50.0%

default

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 730.5 KB
  • The largest value (no) is over 65.44 times larger than the second largest value (yes)

Length

Mean 2.0151
Standard Deviation 0.1218
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 22492
Lowercase Letter 22492
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

balance

categorical

Approximate Distinct Count 3802
Approximate Unique (%) 34.1%
Missing 25
Missing (%) 0.2%
Memory Size 818.6 KB
  • The largest value (0 $ ) is over 19.85 times larger than the second largest value ( 1,00 $ )

Length

Mean 10.2624
Standard Deviation 1.8482
Median 10
Minimum 5
Maximum 13

Sample

1st row 2 343,00 $
2nd row 45,00 $
3rd row 1 270,00 $
4th row 2 476,00 $
5th row 184,00 $

Letter

Count 0
Lowercase Letter 0
Space Separator 36787
Uppercase Letter 0
Dash Punctuation 686
Decimal Number 55319
  • balance contains many words: 1138 words
  • The largest value (1) is over 1.87 times larger than the second largest value (2)

housing

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 735.5 KB

Length

Mean 2.4731
Standard Deviation 0.4993
Median 2
Minimum 2
Maximum 3

Sample

1st row yes
2nd row no
3rd row yes
4th row yes
5th row no

Letter

Count 27605
Lowercase Letter 27605
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

loan

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 731.8 KB
  • The largest value (no) is over 6.65 times larger than the second largest value (yes)

Length

Mean 2.1308
Standard Deviation 0.3372
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 23784
Lowercase Letter 23784
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

contact

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 794.2 KB
  • The largest value (cellular) is over 3.43 times larger than the second largest value (unknown)

Length

Mean 7.8592
Standard Deviation 0.5096
Median 8
Minimum 7
Maximum 9

Sample

1st row unknown
2nd row unknown
3rd row unknown
4th row unknown
5th row unknown

Letter

Count 87724
Lowercase Letter 87724
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (cellular, unknown) take over 50.0%
  • The largest value (cellular) is over 3.43 times larger than the second largest value (unknown)

day

numerical

Approximate Distinct Count 31
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.4 KB
Mean 15.658
Minimum 1
Maximum 31
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • day is skewed right (γ1 = 0.1113)

Quantile Statistics

Minimum 1
5-th Percentile 3
Q1 8
Median 15
Q3 21
95-th Percentile 30
Maximum 31
Range 30
IQR 13

Descriptive Statistics

Mean 15.658
Standard Deviation 8.4207
Variance 70.9089
Sum 174775
Skewness 0.1113
Kurtosis -1.0614
Coefficient of Variation 0.5378
  • day is not normally distributed (p-value 3.632630906173671e-17)

month

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 741.2 KB
  • The largest value (may) is over 1.86 times larger than the second largest value (aug)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row may
2nd row may
3rd row may
4th row may
5th row may

Letter

Count 33486
Lowercase Letter 33486
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The largest value (may) is over 1.86 times larger than the second largest value (aug)
  • month has words of constant length

duration

numerical

Approximate Distinct Count 1428
Approximate Unique (%) 12.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.4 KB
Mean 371.9938
Minimum 2
Maximum 3881
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • duration is skewed right (γ1 = 2.1434)

Quantile Statistics

Minimum 2
5-th Percentile 51
Q1 138
Median 255
Q3 496
95-th Percentile 1079.9
Maximum 3881
Range 3879
IQR 358

Descriptive Statistics

Mean 371.9938
Standard Deviation 347.1284
Variance 120498.1162
Sum 4.1522e+06
Skewness 2.1434
Kurtosis 7.2975
Coefficient of Variation 0.9332
  • duration is not normally distributed (p-value 9.120924128296628e-11)
  • duration has 636 outliers

campaign

numerical

Approximate Distinct Count 36
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.4 KB
Mean 2.5084
Minimum 1
Maximum 63
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • campaign is skewed right (γ1 = 5.5448)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 2
Q3 3
95-th Percentile 7
Maximum 63
Range 62
IQR 2

Descriptive Statistics

Mean 2.5084
Standard Deviation 2.7221
Variance 7.4097
Sum 27999
Skewness 5.5448
Kurtosis 57.3635
Coefficient of Variation 1.0852
  • campaign is not normally distributed (p-value 3.423052336532046e-24)
  • campaign has 601 outliers

pdays

numerical

Approximate Distinct Count 472
Approximate Unique (%) 4.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.4 KB
Mean 51.3304
Minimum -1
Maximum 854
Zeros 0
Zeros (%) 0.0%
Negatives 8324
Negatives (%) 74.6%
  • pdays is skewed right (γ1 = 2.4497)

Quantile Statistics

Minimum -1
5-th Percentile -1
Q1 -1
Median -1
Q3 20.75
95-th Percentile 326
Maximum 854
Range 855
IQR 21.75

Descriptive Statistics

Mean 51.3304
Standard Deviation 108.7583
Variance 11828.3639
Sum 572950
Skewness 2.4497
Kurtosis 6.8348
Coefficient of Variation 2.1188
  • pdays is not normally distributed (p-value 6.789104740803461e-25)
  • pdays has 2750 outliers

previous

numerical

Approximate Distinct Count 34
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.4 KB
Mean 0.8326
Minimum 0
Maximum 58
Zeros 8324
Zeros (%) 74.6%
Negatives 0
Negatives (%) 0.0%
  • previous is skewed right (γ1 = 7.3343)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 1
95-th Percentile 5
Maximum 58
Range 58
IQR 1

Descriptive Statistics

Mean 0.8326
Standard Deviation 2.292
Variance 5.2533
Sum 9293
Skewness 7.3343
Kurtosis 106.1497
Coefficient of Variation 2.753
  • previous is not normally distributed (p-value 6.720003943382288e-25)
  • previous has 1258 outliers

poutcome

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 783.8 KB
  • The largest value (unknown) is over 6.78 times larger than the second largest value (failure)

Length

Mean 6.9038
Standard Deviation 0.428
Median 7
Minimum 5
Maximum 7

Sample

1st row unknown
2nd row unknown
3rd row unknown
4th row unknown
5th row unknown

Letter

Count 77060
Lowercase Letter 77060
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (unknown, failure) take over 50.0%
  • The largest value (unknown) is over 6.78 times larger than the second largest value (failure)

deposit

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 735.5 KB

Length

Mean 2.4738
Standard Deviation 0.4993
Median 2
Minimum 2
Maximum 3

Sample

1st row yes
2nd row yes
3rd row yes
4th row yes
5th row yes

Letter

Count 27613
Lowercase Letter 27613
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

Interactions

Correlations

Missing Values